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Abstract 

We follow the works of Puzynina and Zamboni, and Rigo et al. on abelian re- 
turns in Sturmian words. We determine the cardinality of the set AVlZu of 
abelian returns of all prefixes of a Sturmian word u in terms of the coefficients 
of the continued fraction of the slope, dependingly on the intercept. We pro- 
vide a simple algorithm for finding the set AP1Z U and we determine it for the 
characteristic Sturmian words. 



1. Introduction 

Although Sturmian sequences have been studied for more than 70 years, 
new properties and characterizations still appear. Return words, introduced by 
Durand [8], were used in 2001 for one of such equivalent definitions. Vuillon [20] 
has shown that an infinite word is a Sturmian word if and only if every its 
factor has exactly two return words. In 2011, Puzynina and Zamboni [14] use 
an abelian modification of the notion of return words for deriving yet another 
equivalent characterization of Sturmian words. The adjective 'abelian' is used 
when in a word w, we are interested only in the number of occurrences of a 
letter a and not in the order of letters. Formally, two finite words u>, w' over an 
alphabet A arc abelian equivalent, denoted w ^ ab w' , if |u>| a = |w'| a for every 
a £ A, where \w\ a stands for the number of occurrences of the letter a in the 
word w. In order to define abelian return words to a factor w of length |u>| = n 
in an infinite word u = UQU1U2 • • • , denote by n\ < • • • < ni < < • • • 

the consecutive occurrences of factors which are abelian equivalent to w, i.e. 
u ni u nz+ i ■ ■ ■ u„ i+ „_i — ab w, and UjU j+1 ■ ■ ■ u j+n -i 7^ ab w for j ^ ni. Then the 
factors v = u ni u ni+ i ■ ■ ■ u nj+1 _i arc called abelian return words to w. Given a 
factor w in u, the set of abelian return words of w in u is denoted by AlZ WtVL . 
The main result of Puzynina and Zamboni is the following. 
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Theorem 1.1 ( [14]). An aperiodic recurrent infinite word u is Sturmian if and 
only if every factor of u has two or three abelian return words, i.e. #ATZ WtU £ 
{2, 3} for any factor w in u. 

If a factor has three abelian return words Ri, R2, R3, then for their length 
one has |i?i| + I-R2I = I-R3I, see [15]. We will show that in fact R3 — R1R2 (cf. 
Theorem 4.5). 

Rigo ct al. [17] have studied the set of abelian returns to all prefixes of a 
Sturmian word u, denoted by AVTZ U = [J {A1Z W ^ U : w is a prefix of u}. 

Theorem 1.2 ([17]). Let u be a Sturmian word with intercept p. Then AP1Z U 
is finite if and only if p =/= 0. 

In Theorem 4.13 we determine the cardinality of the set AVTZ U dependingly 
on the intercept p and the continued fraction of the slope a. We provide an 
algorithm for listing the elements of AV7l u explicitly. We perform the study for 
every characteristic Sturmian word (Proposition 6.1), extending thus the result 
of Rigo et. al [17] who show that AVK f = {0, 1, 01, 10, 001} for the Fibonacci 
word f . 

For our purposes, we present Sturmian words as codings of exchange T : 
[0, 1) — > [0, 1) of two intervals. The main tool for obtaining our results is the 
study of itineraries under the first return map induced by T to a subinterval 



2. Some facts about Sturmian words 

Sturmian words are usually defined as infinite aperiodic words having for 
every n £ N exactly n + 1 different factors of length n. For details about 
definition and basic properties of Sturmian words sec [12]. For our purposes, 
it is useful to consider the equivalent characterization of Sturmian words as 
codings of exchange of two intervals. For an a £ (0, 1) we consider the exchange 
T of two intervals of length a and 1 — a, namely T : [0, 1) — s- [0, 1) given by the 
prescription, 



If a is irrational, then the orbit of any given p £ [0, 1) is infinite, and the sequence 
p, T(p), T 2 (p), T 3 (p), . . . can be coded by an infinite word u a ,p = uqU\U2 ■ ■ ■ 
over the alphabet {0, 1} given naturally by 



Such an infinite word is Sturmian with slope a and intercept p. It is well known 
that any Sturmian word can be obtained by coding an orbit of a 2-interval 
exchange for some irrational slope a and intercept p, or by a similar exchange 
of intervals where the intervals Jo, J\ are semi-open from the other side. 



ic [0,1). 




x + 1 — a if x £ [0, a) =: Jq , 
x — a if x € [a, 1) =: Ji . 



(1) 
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All Sturmian words u QjP of the same slope a have the same language £(a), 
i.e. the same set of factors. Among Sturmian words with the same slope, one is 
exceptional - the so-called characteristic word - namely the one with intercept 
p = 1 — a. We denote u Qi i_ Q = c a . Among the exceptional properties of c a is 
that every prefix w of c Q is a left-special factor, i.e. both Ow and lw belong to 
the language C(a). 

For any factor w £ C(a), there is an interval J w C [0, 1), the cylinder of w, 
such that the prefix of length n of the infinite word u a ,p is equal to w, taking 
any intercept p £ J w . The n + 1 subintervals J w for w £ C(a) n {0, 1}" form a 
partition of [0, 1) and their boundary points are determined by the n numbers 
a, T-^a), ...,T-"+V). 

For every fixed n, the lengths of intervals J w take at most 3 values, where 
the longest is the sum of the two shorter ones. This is the statement of the so- 
called three-gap theorem, which was independently proved by several authors, 
for example [18, 19]. The length of J w corresponds to the frequency of the factor 
w in u atX , as mentioned in [7]. These lengths take one of the values - here 
denoted by 8k, s - in a discrete set, which is described in terms of the continued 
fraction of the parameter a — [0,ai,a2,a3, . . .]. For an overview about three 
gap theorem and related results, see [2]. The values 5k, s are important for our 
further considerations, that is why we provide them here explicitly. Recall that 
the numerators pk and the denominators q k of the convergents ^ of a satisfy 
the recurrence relation pk = a^Pk-i + Pk-2, Qk = a>k<lk-i + Qk-2, with initial 
values pq = ciq = 0, go = 1, and p_i = 1, g_i = 0, so that the recurrence holds 
for every k > 1. Denoting 

h,8 ■= \( s - l )(Pk - aq k ) +p k -i - aqk-i\ , for k > 0, 1 < s < a k+1 , (2) 
one has 

5k, s < 8k', S ' <=> ks >- lei k's' . 

It is known [13] that Sturmian words are balanced, which means that for 
every pair of factors w, w' of the same length \w\ = \w'\ one has | \w\q— \w'\q\ < 1. 
This implies that for any n, the number \w\o of letters in a factor w of length 
n can take only two values. In accordance with [17], we call the factors w with 
higher number |iy|o light, and the other ones heavy. 

It can be shown that the union of intervals J w over all light factors w is 
again an interval. The same is true for heavy factors. The statement of the 
following lemma can be derived for example from the proof of Lemma 8 of [3] 
or the proof of Theorem 8 of [17]. 

Lemma 2.1. A prefix of length n of the infinite word Uq, iP is light if and only 
if p £ [0, T~ n+1 (a)^ . A prefix of length n of the infinite word u 0yP is heavy if 
and only if p £ [T-' n+1 (a), l) . 

3. Abelian returns of Sturmian factors 

The main tool for describing abelian return to prefixes of Sturmian words is 
the study of first return map to a subinterval / of [0, 1) under the transformation 
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T from (1). 

Let T be an exchange of two intervals as in (1). To every subinterval I C 
[0, 1) we can associate a mapping r : / — > {1, 2, 3, . . . }, the so-called return time 
to /, by setting 

r(x) = min{n G N, n > 1 : T n (x) G /} . 

The prefix of length r(x) of the Sturmian word u a x coding the orbit of the 
point x is called an /-itinerary under T and denoted R(x). 

With these notions we can formulate the connection of abclian returns to a 
prefix of a word u Q p to the /-itineraries following from Lemma 2.1. 

Lemma 3.1. Let w be a factor of a Sturmian word u a p of length \w\ = n 
for some n G N. Put I = [0,T-" +1 (a)) if w is light and I = [T-" +1 (a), l) 
otherwise. Then v is an abelian return to w if and only if v is an I -itinerary. 

As a consequence we have the following description of the set APlZ Ua . 

Corollary 3.2. Let a, p G [0,1), a irrational. Let u be a Sturmian word with 
slope a and intercept p. Then the set of abelian return words of prefixes of u 
satisfies 

avu u = n a p \j -r! c ; , 

where 

IZp = l^J {R : R is a [0, (3) -itinerary} , 

P</3<1 

TZ'p = [J {R : R is a [7, l)-itinerary} . 

0<7<p 

Proof. Let w be a prefix of length n of the word u which is light. This means that 
for its intercept p one has p G J w C [0, T~ rl+1 (a)) . According to Lemma 3.1, if 
v is an abelian return to w, then it is an /-itinerary, where / = [0, T _n+1 (a)) . 
Thus v G Tip*. Similarly, if w is a heavy prefix of length n of the word u, we 
derive that its abelian return words satisfy v G 7t' p a . 

For the other inclusion, we will use the following claim which follows from 
the properties of the transformation T, see [11]. 

Fact 3.3. Let a G (0,1) be irrational and let T be defined by (1). Denote 
Sp = {R : R is a [0, (3) -itinerary} . Then for any (3q G (0,1) there exists a 
neighbourhood Hp such that Sp C Sp for any (3 G Hp . 

Let p < flo < 1. If f3o = T~ n+1 (a) for some a, then the set Sp of I- 
itineraries for the interval / = [0, /3o) = [0, T~ n+1 (a)) is, according to Lemma 3.1, 
formed by abelian returns to the prefix of u of length n. If (3q 7^ T~ n+1 (a) for all 
n G N, we use the fact that {T~ n (a) : n G N} is dense in [0, 1). By Fact 3.3, wc 
find 13 = T- n+1 {a) sufficiently close to /? , so that S/3 C Sp. Since Sp C AVTZ U , 
the proof is established. □ 
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Remark 3.4. Note that TZ" is the set of all abelian returns to all light prefixes 
of the Sturmian word VL a ,p- Similarly, 71?™ is the set of all abelian returns to all 
heavy prefixes of the Sturmian word u ajP . 

4. First return map for Sturmian systems 

In the previous section we have explained that for determining the set AVTZ U 
for a given Sturmian word u, it is important to derive what are the /-itineraries 
under the transformation (1), in particular, for intervals I of the type / = [0, j3) 
rcsp. / = [7, 1). We provide such description in Theorem 4.5. 

First we recall the notion of fc-interval exchange transformations. Let Jo U 
JiU- ■ -U Jk-i = [0, 1) be a partition of [0, 1) into intervals closed from the left and 
open from the right. Let to, t\, . . . , t^-i be constants such that the mapping T 
defined by T(x) = x + tj for x E Jj is a bijection T : [0, 1) — > [0, 1). For example, 
the transformation (1) is an exchange of two intervals. Clearly, intervals T(Jo), 
T( Ji), . . . , T(Jfc_i) also form a partition of [0, 1). Their ordering in the interval 
[0, 1) is usually specified by a permutation on {0, 1, . . . , k — 1}. 

For a general exchange T : [0, 1) — > [0, 1) of k intervals it was proven in [11] 
that the induced mapping Tj : I — > / given by T/(.t) = T r< - X \x) is an exchange 
of at most k + 2 intervals. For k = 3, this result can be stated in a stronger 
form: If T is an exchange of three intervals with permutation (321), then for 
any interval /, the induced map Tj is cither again an exchange of three intervals 
with permutation (321), or exchange of two intervals, see Theorem 4.1 in [4]. 
In this paper, T is an exchange of two intervals. The following statement about 
induced maps is a reformulation of Proposition 4.5. of [10]. 

Proposition 4.1. Let T : [0,1) [0,1) be given by (1). Let I = [c,c + 5), 
where < c < c + 8 <1. Then the induced map Tj is 

• an exchange of two intervals, if 6 — 6k. s for some k > 0, 1 < s < a^+i, 
where Sk.s is defined in (2); 

• an exchange of three intervals with permutation (321), otherwise. 

As it was already mentioned, the values 5k, s represent lengths of cylinders J w 
of factors w of the Sturmian word. In particular, choosing I — J w , the induced 
map Tj is an exchange of two intervals, and the return time r(x) therefore takes 
two values for x G /. The result of Vuillon [20] states a stronger fact, namely 
that the /-itineraries R(x) take also only two values for x G /, and these are 
the classical return words to the factor w. If / C [0, 1) is chosen arbitrarily, the 
above theorem implies that the return time r(x) for x £ / takes at most three 
values. The /-itinerary R(x) can however take, in general, more values than 
three. For, if R(x) 7^ R(y), but R(x) and R(y) are still abelian equivalent, then 
r(x) = r(y). 
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Example 4.2. Let a = -, where t = g(l + is the golden ratio. In this 
case, the transformation T : [0, 1) — > [0, 1) is of the form 

<f*e[i,i). 

Consider the interval I = [-\, ^ + -4r). Define 

h = [73,72), ^2 = [72, 72 + 7V ), ^3 = [72 + 75 , 7), A = [7, 7 + 74)- 

Lei us fot /or every subinterval Ij the corresponding return time r(x), the in- 
duced map Ti(x), and the I -itinerary R{x), where x G Ij: 



x e h 


r(x) = 1 


Tl (x)=x+^ 


R(x) = 


X € 1 2 


r(x) = 3 


Ti(x) = x+± 


i?(x) = 010 


x G h 


r(x) = 2 


Tj(x) = x- 7V 


R(x) = 01 


X £ I4 


r(x) = 2 


7>(x) = x- ^ 


R(x) = 10 



We see that the induced map Tj is an exchange of three intervals I\, I2, and 
I3 U I4 with permutation (321). However, the I -itineraries are four different 
words. 

In the previous example we have seen that for a general subinterval I C [0,1), 
the set of /-itineraries may have four elements. We will focus on the case when 
/ has the form / = [0, 0) for some f3 < 1 and show that then 

#S p = : R is an [0, ^-itinerary} < 3 . 

Before that, we give examples of the most simple cases of the choice of /. 

Example 4.3. Let I = [0, j3), where max{a, 1 — a} < f3 < 1. We determine all 
I -itineraries by inspecting Figure 1. 
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The interval I splits into three subintervals, 

h = [0,/3 + a-l), I 2 = [/3 + a-l,a), l a = [a,0), 

for which the return time and the itinerary is constant. We list them together 
with the induced map in the following table. 



x e h 


r(x) = 1 


Ti{x) = x + 1 — a 


R(x) = 


X G 1 2 


r(x) = 2 


T^x) =x + l-2a 


R{x) = 01 


x G h 


r(x) = 1 


Ti(x) = x — a 


R(x) = 1 



Ifm&x{a, 1 — a} < /3 < 1, then the induced map Tj is an exchange of three 
intervals with permutation (321) of the form 

!x+l-a if x G [0, (i + a - 1) , 
x + l-2a ifx G IP + a- l,a), (3) 
x — a if x G [a, /?) . 

For (3 = 1, we obtain from Tj the original transformation T with I -itineraries 
0, 1. It is also interesting to consider the other extreme value j3 = max{a, 1 — a}, 
where the induced map Tj is again an exchange of two intervals, see Example 5. 1 . 

Example 4.4. Example 4-3 is a special case of the following situation: Let 
I = [0,/3) where a < (3 < 1. For determining all I -itineraries, denote by I the 
non-negative integer such that 

1 - (I + l)a< f3 < 1 -la. 

The interval I splits into three subintervals, 

h = [0,(3 + (l + l)a-l), I 2 = {(3 + (l + l)a-l,a), I 3 = [a,p), 

for which the return time and the itinerary are constant, as seen in the table 
below. 



x G h 


r(x) =1 + 1 


Tj(x) =x + l-(l + l)a 


R(x) = 01' 


x G h 


r(x) =1 + 2 


Tj(x) =x + l- {l + 2)a 


R(x) = 0l l+1 


xel 3 


r(x) = 1 


Tj{x) = x — a 


R(x) = 1 



When (3 = 1 — la, the interval [f3 + (I + l)a — 1, a) vanishes, and thus the 
induced map Tj is an exchange of two intervals. Otherwise, it is an exchange 
of three intervals with permutation (321) of the form 

{x + \-(l + l)a ifxe[0,[3+(l + l)a-l), 
x + l-{l + 2)a ifx£[/3+(l + l)a-l,a), (4) 
x — a if x G [a, (3) . 
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Theorem 4.5. Let T be an exchange of two intervals as in (1). Let < f3 < 1. 
Then for the interval I = [0, f3) there exist words R and R' , R -< Ul R' , such that 
the I -itinerary R(x) of every x £ L under T satisfies R(x) £ {R, R' , RR'}. 

Proof. Consider first the case when a £ I = [0, /?). The first return map to / is 
given by (4) in Example 4.4 together with the /-itineraries R — 0l l , R' — 1, and 
R" = RR' = 01 i+1 . We can therefore restrict our considerations to an interval 
I = [0, /3) such that a £ I. Let K be an interval K C [0, 1). We distinguish 
three types of events for K. 

Event a) for K occurs if a £ T k (K) for some k > 1. 

Event b) for K occurs if T l (K) n I ^ for some I > 1, while T l (K) <jt I. 

Event c) for K occurs if T m (K) C / for some m > 1. 

Consider first K = I. Since the transformation T is minimal (only trivial 
subsets Ac [0, 1) satisfy T(A) C A), the first event which occurs for K is either 
a) or b). 

Case 1. Let the first event be a). Since the left end-point of K is = T(a), 
necessarily a = T _1 (0) is an inner point of T fe (if). Put I 3 = T- k (T k (L)n[a, 1)) . 
Then T k+1 (L 3 ) C J = [0,/?). Clearly every x € I 3 satisfies r(x) = fc + 1 and 
R(x) is constant on ^3, denote R(x) = R' . Note that the last letter of R' is 1 and 
thus it is lexicographically greater than the /-itinerary R(x) of every x £ I\L 3 . 

Put now K = I \ I3. If for such K the first occurring event is c) then 
the /-itinerary R(x) is constant on I \ I3, say R(x) = R. Wc obviously have 
R 1 = wl, R = wOv for some non-empty words w, v £ {0, 1}*, and thus indeed 
R -< Iox R'. The transformation Tj induced by T on / is the exchange of two 
intervals /1 = L\L% and I3 . 

Suppose, on the other hand, that the first occurring event for K = /\/3 is not 
c) . Then Tj is necessarily an exchange of three intervals and the first occurring 
event is b). For, if it were a), then Tj is not an injective map, a contradiction. 
Put h = T- l (T l (L \ / 3 ) n /). For every x £ h, we have r(x) = I > k + 1, and 
R(x) = R is constant on I\. 

Denote 1% = I \ I%\ 1$. For every x £ I2, we have r(a;) > I. As T/ is an 
exchange of three intervals, we must have r(x) constant on L2, equal to the sum 
of return times for / 3 and /1, namely r(x) = k + 1 + I. Necessarily K = L2 
encounters first the event c) (T k+1+l (L2) C /), and thus also R{x) is constant 
on I2, say R", and we know that it is of length k + I + 1. 

Let us describe /?". By construction, necessarily R is a prefix of R". Consider 
the union / 3 U T (I2). It is an interval, since the right end-point of / 3 is /3, and 
by the definition of h, the left end-point of T l (I 2 ) is /3. Set K = / 3 UT ; (/ 2 ). For 
every point x in /C, the smallest index j such that T 3 (.t) e J is j = k + 1. This 
corresponds to the fact that the first event for K is c), T k+1 {K) = T k+1 (L 3 U 
T l {l2)) C /. We derive that the suffix of R" of length fc + 1 is the same as the 
/-itinerary of points in / 3 , namely R' . Thus, indeed, R" = RR'. 
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Case 2. Let the first event for K = I be 6). Set h = T- l (T l (I) n I). 
Obviously, the return time r(x) = I and the /-itinerary R(x) is constant on I\. 
Denote R — R(x) for x G I\, The /-itinerary of every x G / \ /i has /? as a 
prefix. Thus /? is the smallest among /-itineraries on /. 

For K = I \ h the event a) occurs for some k > I. Set / 3 = T~ k (T k (I \ 
Ii) n [a, 1)). For every x G /3, r(x) = k + 1 and /?(x) = R' is constant on / 3 . 

Put I2 = I \ 1\ \ I3. If I2 = 0, then Xj is the exchange of two intervals I\ 
and /3, and the proof is finished. If not, then the return time r(x) = k + I + 1 
is constant on / 2 , and thus for K = I 2 neither event a) nor event b) may occur 
sooner than c). We have T fe+i+1 (/ 2 ) C / and the /-itinerary on / 2 is also 
constant, say R". Let us describe R" . We already know that R is a prefix of 
R". Consider the union / 3 UT'(/ 2 ). It is an interval, for which first occurs event 
c) with T k+1 (/ 3 U T l {I 2 )) C /. Thus R' is a suffix of R" . 

Case 3. It may happen that for K = I event a) and event &) occur at the 
same time, i.e. k = I. Then we set h = T- l (T l {I)C\l), h = T- k (T k (I)n [a, 1)) 
and / 2 = /\ 1\ \ I3. Denoting the /-itinerary R{x) for x G I\ by /?(a;) = R = w, 
then /?(cc) = R' = vol for x G /3, and /?(x) = /?" = wwl for a; G / 2 . □ 

Remark 4.6. Sofar we have considered the transformation defined on intervals 
closed from the left. In the statement of Theorem 4-5 we could write all intervals 
closed from the right, the result would be the same. 

With regard to Lemma 3.1 and Remark 4.6, Theorem 4.5 has the following 
consequence. 

Corollary 4.7. For every factor w of a Sturmian word u there exist factors 
w\, W2 such that the set of abelian returns to the factor w satisfies A1Z W . u G 
{wi 1 w 2l wiw 2 \. 

Proof. If w is a light factor of u, then the statement is contained in Theorem 4.5. 
If w is heavy, then consider factor E{w) and the Sturmian word E(u) where 
application of E means that we interchange OhI. Thus E(w) is a light factor 
of E(u), for which the statement holds. Clearly, v is an abelian return to w in 
u if and only if E(v) is an abelian return to E(w) in E(u). □ 

Theorem 4.5 thus provides, as a consequence, an alternative proof of what 
has appeared in [15], namely that if a given factor w of a Sturmian word u has 
three abelian return words, then their lengths h, I2, h satisfy h + h = h- 

For calculating the cardinality of AVTZu we need by Corollary 3.2 to deter- 
mine the cardinality of the sets 7?." and TZ'p of abelian returns of all light and 
heavy prefixes, respectively. In fact, it suffices to study abelian returns to light 
prefixes, due to the symmetry mentioned in the proof of Corollary 4.7. The 
following lemma is a consequence of the fact that u is a Sturmian word with 
slope a and intercept p if and only if E(u) is a Sturmian word with slope 1 — a 
and intercept 1 — p. 



10 



Lemma 4.8. Let a,p€ (0,1), a irrational. Then 

n'; = E(n\z a p ) , 

where E : {0, 1}* — > {0, 1}* is determined by E(a) = 1 — a, for a G {0, 1}, i.e. 
interchanging and 1. 

We use the notation (2) and Proposition 4.1. 

Corollary 4.9. Let a, p e (0,1), where a = [0, ai, 02, 03, • ■ • ] is irrational. Let 
k, s S N, 1 < s < afc+i 6e minimal such that p > $& )S . Then 

#Kp = 1 + 01 + a 2 + • • • + a k + s . 

Proof. Let S < 5 be two consecutive values of the form 5k, s- For the interval 
I = [0, $) we have two /-itineraries, i.e. Ss = {R, R'}- According to Theorem 4.1 
and Theorem 4.5, for j3 satisfying 6 < (3 < 5, the set Sp has three elements, 
by Lemma 3.3 not depending on /3, i.e. Sp = {R, R' , RR'}, when R -< lcx R'. 
Moreover, S$ C {R, R' , RR'}. Since shortening of the interval L yields longer 
/-itineraries, necessarily RR' € i.e. S$ = {/?, RR'} or Sg = {R', RR'}. It 
follows that for the description of the set 1Z P , it suffices to find all values 5k, s 
belonging to (p, 1]. In particular, 

4TZ a p =2 + #{5 > p : 5 = 5 itj for some i e N, 1 < j < a,} . (5) 

The summand 2 is obtained as follows: The length 1 = 5o,i provides two 
itineraries, namely and 1. Every next smaller value 5i,j then contributes 
with one more itinerary to TZ p . The last one to contribute is 5k, s for minimal 
indices k, s such that 5k, s < P- d 

Remark 4.10. For any irrational a, if p = 0, then there are infinitely many 
values 5k, s > p. Therefore ^TZq = +00 and hence by Corollary 3.2 also 
^AVlZu = +00 for any Sturmian word u with zero intercept, as shown already 
in [17]. 

Let us calculate several initial values of the decreasing sequence (5k, s ), k > 0, 
1 < s < a k+ i- 

Example 4.11. Let /j, = [0, <Zi, 02, 03, . . . ], with a\ > 2, and v = [0, 1, 62, ^3, • ■ • ]■ 

The sequence 5k, s corresponding to fi has elements 

$0,1 = 1, $0,2 = 1-/", $0,3 = 1-2/4, $0,01 = 1 - (ai - 1)m, 

$1,1=//, $1,2 = (ai + 1)m - 1, $1,3 = (2ai + l)/t - 2, ... 

where clearly 

l>l-/i>l-2/x>--->l-(ai-l)/x >/«>••• (6) 

TTie sequence 5k, s corresponding to v has elements 

$o,i = l, $ M = j/, $i i2 = 2i/-l, $i i3 =3i/-2, 
$1,62=^2^-^2 + !, $2,1 = 1-^, ■■• 
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where 

1 > v > 2u - 1 > • • • > b 2 v - {b 2 - 1) > 1 - v > ■ ■ ■ (7) 
Proposition 4.12. Let a, p G (0, 1), a irrational, u = u a , p . 

1. J/max{a, 1- a} < /? < 1, tten ^ = {0, 1,01}. 

2- If0<p< min{a, 1 - a}, then TZ'° = {0, 1, 10}. 

3. For any p G (0, 1), we /iaue ft£ n ^ = {°> !}■ 

4. for any p G (0,1), we have {0, 1,01, 10} C AVK U . 

Proof. In Example 4.11, consider p = minja, 1 — a} and v = 1 — p = max{a, 1 — 
a}. In both (6) and (7) we see that the second largest (after 6o t i) value of 
the decreasing sequence (<5fe, s ), k > 0, 1 < s < ak+i is max{a, 1 — a}. Re- 
lation (5) then implies that #7\L" =3. In fact, as seen from Theorem 4.5, 
IZp = {0,1,01}, see also Example 4.3. From the proof of Corollary 4.9 it is 
obvious that {0, 1,01} C Tip for any p G (0, 1). By symmetry, we can derive 
for < p < minja, 1 — a} that IZ'^ = {0, 1, 10}, cf. Lemma 4.8, and we have 
{0, 1, 10} C TZ'p for any p € (0, 1). Combining the above and using Corollary 3.2, 
we have statement 4 of the proposition. 

In order to prove statement 3, realize how the /-itineraries of an interval of 
the form / = [0,/3) arise. Directly from the definition of the transformation (1), 
we see that if ft < a, then for every x G [0, /3) the [0, /3)-itinerary R{x) of x has 
the prefix 0. If (3 > a, then the [0, /3)-itinerary of x G [«,/?) is R{x) = 1; for 
every x G [0,a), the [0, /3)-itinerary R(x) of x has the prefix 0. Thus the only 
element of TZ" not having prefix is the word 1. Similarly, the only element of 
lZ p a = E(lZ\Zp) not having prefix 1 is the word 0. The statement follows. □ 

For simplicity of notation, the following theorem is stated for Sturmian words 
whose slope a satisfies a < |. 

Theorem 4.13. Let a, p £ (0,1), a = [0, Oi, a 2 , ■ • • ] irrational, a\ > 2. Let u 
be a Sturmian word with slope a and intercept p. 

(i) Let p G (a, 1 — a). Then 

#AVll u G {ai +3,ai +4}. 

(ii) Let p £ (a, 1 — a). Denote k > and 1 < s < a^+i minimal integers such 
that min{p, 1 — p} > 5k, s - Then 

#AVK U = 2 + oi + • • • + a k + s . 

Proof. We will use the formula AVTl u = 1Z a p \JE{jl\z a p ) (as derived from Corol- 
lary 3.2 combined with Lemma 4.8). Since a = [0, Oi, a 2l • • • ], Oi > 2, we have 
1 — a = [0, 1, Oi — 1, 02, 03, . . . ]. Let us first prove statement (i). Substitution 
p = a and v = 1 — a into the prescriptions (6) and (7) for the sequences 5k. s in 
Example 4.11, we see that they start with the same values 



1 > 1 — a> 1 — 2a> ••• >1— (ai — l)a > a > 
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In order to determine the cardinality of 1Z™ by (5), we find an index i € 
{2, . . . , ai} such that 

l-ia < p < 1 - (i- l)a. (8) 

Then = 2 + i. Obviously, E{lZ\zf) has the same cardinality as n\Z p \ 

which is determined by finding an index I £ {2, . . . , ai} such that 

1 - la < 1 - p < 1 - (I- l)a, (9) 

whence #E(n\z a p ) =2 + 1. 

Since the intersection of Tip* and 1Z'" = E(TZ\Z^) contains by statement 3 of 
Proposition 4.12 exactly two elements, we can conclude that AVTl u = 2 + i + I. 
Let us find the relationship between i and I. 

Inequality (9) can be rewritten as (I — l)a < p < la. Using l/(ai + 1) < 
a < 1/ai we verify that 

(/ - \)a < 1 — (ai — I + l)a < la . 

We have to distinguish two cases. 

a) If (I — l)a < p < 1 — (ai — I + l)a, then i = a x — I + 2, and thus 
#AVK U = ai + 4. 

b) If 1 — (ai — / + l)a < p < la, then i = a\ — I + 1, and consequently 
#ATK U = ai + 3. 

In order to show the second statement of the theorem, consider p ^ (a, 1 — a). 
Let first p < min{a, 1 — a}. From Corollary 4.9, we derive #7?." = 1 + a\ + 
«2 + ■ ■ ■ + a.k + s, where /c > and 1 < s < ak+i are minimal integers such that 
min{p, 1 — p} > 5k, s . By statement 2 of Proposition 4.12, we have #7Z'" = 3. 
Together with statement 3 of Proposition 4.12, we conclude that AP7Z U = 
IZp U VJp = 2 + a\ + a,2 + ■ ■ ■ + ak + s. The proof for p > max{a, 1 — a} is 
similar. □ 



5. Algorithm for finding TZ." 

For the explicit description of the set lZp* , we will exploit Corollary 4.9 and 
its proof. If fco, so are minimal indices such that <5fc , So < p, then the number of 
elements, say TV, in 71" is equal to 

N = 1 + ai + a 2 + • • • + a ko + s . (10) 

The set Tip* contains precisely all [0, <5)-itineraries for every element 6 of the 
sequence (<i>fc iS ), such that 5 > Sk , So - For convenience, we change the indices of 
the elements of the sequence {Sk, s ) to integers, so that the sequence (i5„)„>o be 
strictly decreasing and {5k, s ■ k > N, 1 < s < a k +i} = {5 n ■ n G N}. Clearly, 
5q = 1, Si = max{a, 1 — a}, etc. 
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The construction of [0, <5„)-itineraries for every n g N uses the idea of the 
proof of Corollary 4.9. If 8 > 8 are two consecutive elements of the decreasing 
sequence (8 n ) and R, R' are the /-itineraries for I = [0, 8), then the /-itineraries 
R, R! for / = [0, 8) are chosen from the triple R, R', RR'. 

The following example shows how to choose the pair R, R' from the triple 
R, R' ', RR' when considering 8 = So = 1 and 5 = <5i = max{a, 1 — a}. 

Example 5.1. For / = [0, 1) the I -itineraries are R = 0, R' = 1. FFe wzZZ use 
Example (4.3). Since 8 = max{a, 1 — a}, we distinguish the cases a > \ and 
a < |. 

If a > \, i.e. then 8 — max{a, 1 — a} — a. Substituting (i = a into (3), 
we obtain two [0, 8) -itineraries, R = and R' = 01. and the induced map 
Tj : [0, 8) — > [0, 8) is given by 



Tj(x) 




ifxe [0,2a-l), 
if x € [2a — 1, a) . 



Coding of orbits of points x £ [0, a) under this exchange of two intervals we 
obtain Sturmian words with slope a = . 

// a < £/ien 5 = max{a, 1 — a} = 1 — a. Substituting /3 = 1 — a into (3), 
we obtain two \0 7 8) -itineraries, R' = 1 and /? = 01, and £/ie induced map 
Tj : [0, 5) — > [0, (5) is g^en &y 



IK*) = 




if x e [0,a) , 
if x £ [a, 1 — a) 



Coding of orbits of points x € [0, 1 — a) under this exchange of two intervals we 
obtain Sturmian words with slope a = . 

As we have mentioned, we are interested in /-itineraries for intervals / of 
length (5 = <5„ for some n £ N. In this case the first return map Tj is an exchange 
of two intervals and there are exactly two /-itineraries. We will use the following 
simple statement. 

Fact 5.2. Let J, / be intervals such that J C / C [0, 1) and 

• Tj is an exchange of two intervals and denote by P, P' the two I -itineraries 
under T , 

• (Tj)j is an exchange of two intervals and denote by Q,Q' the two J- 
itineraries under Ti . 

Then Tj is an exchange of two intervals and the two J -itineraries under T are 
constructed from Q,Q' by applying the morphism 



a : {0,1}* -> {0, 1}* defined by a(0) = P, cr(l) = P' . (11) 
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Sofar, the slope a of the Sturmian word was fixed, whence we used the 
symbol T for the transformation without specifying the parameter. In this 
section it will be useful to denote the transformation from (1) by T a . The fact 
that Tj with the domain I C [0, 1) is an exchange of two intervals means that 
there exists a slope a such that Tj is homothetic to T a with the domain [0, 1). 
In Example 5.1 we have described a such that for T a and I = [0, Si) the first 
return map Tf is homothetic to T a . Thus for J = [0, S2), the first return map 
(Tf)j is homothetic to Tf, where J = ±J = j^[0,S 2 ). 

Denote So > Si > S2 > • • • the decreasing sequence corresponding to a. One 
can easily verify that 



Therefore the itineraries when inducing to [0, $2) under T a can be obtained us- 
ing (11) with the knowledge of [0, 5i)-itineraries under T a and [0, (5i)-itineraries 
under T a . It follows that for determining the entire set 72.", it suffices in each 
step to consider the /-itineraries under T £ for / = [0,<5i) for changing value of 
e and S\ = max{e, 1 — e}. The steps are described in the following algorithm. 

Input: a, p € (0, 1), a irrational. 

Output: K™. 

Step 1: Determine TV according to (10). 

Step 2: e := a, R := 0, R' := 1, K := {0, 1}. 

Step 3: Repeat N — 1 times: 

K :=KU{RR'}, 

if e > I then 

R '.= R, R' := RR 1 , e := 2 " e 1 , 

if e < h then 

if RR' -< lcx R' then R := RR', R' := R' else R := R', R' := RR', 
Step 4: TZ a p := TZ. 

The algorithm requires comparison of a with i . This is easy with the knowl- 
edge of the continued fraction of a. The algorithm can be modified in that 
instead of replacing a with a given in Example 5.1, we replace the continued 
fraction of a with the continued fraction of a. In particular, we have the fol- 
lowing prescriptions. Let a = [0, ai, 02, . . .]. 

If a > i, i.e. ai = 1, then 



a = 



2a- 1 



( 



[0,1,02- 1,03,...], ifa 2 >2, 
[0,03 + 1,04,05,...], if a 2 = l. 
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If a < i, i.e. a\ > 2, then 

a . 

a = - = 0,ai - 1, a 2 ,a 3 , . . . J . 

1 — a 

Example 5.3. Consider a — [0, 1,1,1,1,...], i.e. a — ^, where T = i(l + Vo)- 
Table 1 shows several initial values of the decreasing sequence (<5fc, s ), the corre- 
sponding pair of [0, 6) -itineraries R, R' where R -< let R' , and their concatenation 
RR' which is a valid itinerary for the next smaller value 6. The last column 
of the table shows the continued fraction of the slope a. Note that the first 
partial quotient of the continued fraction decides which of the itineraries R or 
R' is included in the pair of [0, S) -itineraries. In particular, if the set S$ of 
[0, S) -itineraries is S$ = {R, R'}, then for the set Si of [0, 5) -itineraries one has 
S s = {R',RR'}, if ax > 2, and S~ s = {R, RR'}, if a x = 1. 
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R 


R' 


RR' 


continued fraction 


So,i = 1 





1 


01 


[0,1,1,1,1,...] 


= i 





01 


001 


[0,2,1,1,1,...] 


$2,1 = £ 


001 


01 


00101 


[0,1,1,1,1,...] 


£3,1 = 73 


001 


00101 


00100101 


[0,2,1,1,1,...] 


^4,1 = ^ 


00100101 


00101 




[0,1,1,1,1,...] 



Table 1: Itineraries for the Sturmian word with slope a = —. 



6. Abelian return words to prefixes of characteristic Sturmian words 

Let us describe the set APlZ Ca of abelian return words to prefixes of a 
characteristic Sturmian word c a . 

Proposition 6.1. Let a = [0, <Zi, 02, • • • ] be an irrational in (0,1). For the 

characteristic Sturmian word c a we have 

1{1, 10,0,01,001, .. .,0 Q2+1 1} otherwise. 

Proof. The characteristic word c a is a Sturmian word with slope a and intercept 
p = 1 - a. By Corollary 3.2 and Remark 4.6, AVTZ Ca = TZf_ a U n[ a _ a . 

Assume that a > i, i.e. 01 = 1. Then 1 — a = 82, 1, and therefore from 
Corollary 4.9, we have #TZf_ a = l+ai+a2 + l = 3+a2- The explicit form of the 
elements of Hi_ a is determined from the algorithm presented in the previous 
section. The values of the decreasing sequence 5k. s to be considered are <5q.i = 1, 
(5i 1 = a, . . . , <5i,a 2 = a 2Ct — (ci2 — 1), 52,i = 1 — a, cf. Example 4.11 
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The sequence of continued fractions corresponding to the induced exchanges 
of intervals defined on [0, S), for 1 — a < S < 1 together with the associated pair 
of [0, (J)-itineraries are given in the following table. 



s 


CF of the slope 


itineraries 


So,i = 1 


[0,1,02,03,04,.-.] 


0,1 


<5i,i = ct 


[0, l,o 2 - l,a 3 ,a 4 ,...] 


0,01 








Sl,a 2 -1 


[0, 1, l,a 3 , 04, . • .] 


O^-H 


<5l,a 2 


[0,a 3 + l,a 4 , . . .] 


o,o a n 


o~2,i = 1 — a 


[0, 03,04, . . .] 


o a n,o a2+1 i 



We derive that Tlf_ a = {0, 1, 01, 001, . . . , O2+1 l}. 

For determining Tl' 1 a _ a , realize that now 1 — a = min{a, 1 — a}, and thus by 
statement 2 of Proposition 4.12, K'*_ a = {0, 1, 10}. Together, 

ATK Ca = {0, 01, 001, . . . , a2+1 l, 1, 10} . 

The result for the case a < \ is obtained from the symmetry between char- 
acteristic words with slopes a and 1 — a. For, if c Q = u$u\U2 ■ ■ ■ then Ci_ a = 
(1 — ito)(l — Ui)(l — U2) ••• , i.e. ci_ Q = E(c a ). The set of abelian returns for 
ci_ Q is A'P7l Cl _ a = {0, 01, 001, . . . , fc2+1 l, 1, 10}, where 62 is a partial quotient 
of 1 — a = [0, 61, &2, &3, • • •]• As for a < |, we have b\ = 1, the continued fraction 
for a is equal to a = [0, 62 + 1, ^3, &4, • ■ •] = [0, a\, 02, 03, . . .], i.e. &2 + 1 = ffli- 
Therefore .4PK Cq = -E(APft Cl _J = {0, 01, 1, 10, 110, . . ., l Ol 0}. □ 

Remark 6.2. Applying Proposition 6.1 to the case a = - = [0, 1, 1, 1, . . .], we 
obtain that the set of abelian returns to the prefixes of the Fibonacci word f = 
Cx/ T is AVlZf = {0,1,01,10,001}, as shown by a different technique (specific 
for the Fibonacci word) in [17]. 

7. Comments and open problems 

1. In the introduction, the authors of [17] mention the connection of lengths of 
palindromic prefixes of the Fibonacci word f to the number of abelian return 
words to prefixes of f . For more detailed observation about this phenomenon 
occurring in characteristic words, let us recall several relevant facts. 

• In their paper [14], Puzynina and Zamboni also show that any abelian 
return v to a factor of a Sturmian word with slope a is a Christoffcl word, 
i.e. v is either a letter or v — awb, a, b £ A where w is a bispecial factor 
of C(a), i.e. such that w;0, wl, Ow, \w € C(a). Indeed, our construction of 
the set 71" described in Section 5 represents a path in the Christoffcl tree. 
For more details about Christoffcl words, see [6]. 



17 



• Any bispecial factor of C(a) is a palindromic prefix of the characteristic 
word c a and vice versa. Therefore we can arrange the palindromic prefixes 
7r„ of c a according to their length, so that n is the empty word and 

K„| < |7T n+ l|. 

• We denote |^ the fc-th convergent of the irrational slope a = [0, a-y, da, . . . ]. 
For any integer n there exist unique k > and 1 < s < au+i such 
that n = ai + 02 + • • • + fflfc + s — 1. It can be shown (cf. Section 3.1 
in [9]) that the length of 7r„ satisfies 2 + |7r„| = sq^ + Qfc-i. Therefore 
if p ^ (min{a, 1 — a},max{a, 1 — a}) one can observe a relationship 
between the length of the longest abelian return in the set AP1Z U and the 
cardinality #AV1Z U , namely that #AV1Z U is equal to 3+ the index of 
the palindromic prefix of c Q of the same length as the longest element of 
#AVK U - 2. 

2. For the determination of the set 1Z'^ we have used the relation 1Z'" = E(lZ\z") , 
which follows easily from the symmetry of the exchange of two intervals. We 
can say even more, considering the following results. 

• Every abelian return is a Christoffel word. 

• Every bispecial factor in C{a) is a palindrom. 

• Every element of 72" of length at least 2 has a prefix and suffix 1, as 
follows from the algorithm, and every element of 1Z'^ of length at least 2 
has prefix 1 and suffix 0, as follows from Tl'f = E{K\z a p ). The n we can 

derive a simple relation 1Z'£* = lZ\Zp, where w stands for the mirror image 
of w. 

3. From the proofs in [14], it follows that in C{a) there exist infinitely many 
factors having two abelian returns and infinitely many factors having three 
abelian returns. 

As for classical return words, Vuillon has shown that infinite words having 
to each factor exactly two returns are Sturmian words. Words with exactly 
three return words to each factor are characterized in [5]. One can ask about 
existence of infinite words where infinitely many factors have two return words 
and infinitely many factors have three return words. 

4. Using Theorem 4.5, we can describe the structure of the set of /-itineraries 
under an exchange T : [0,1) — > [0,1) of two intervals for every subinterval 
I = [0,(3) or I = (7,1]. What are the /-itineraries for a general subinterval 

ic [0,1)? 

5. In [16], the authors study abelian returns for binary rotation words. These infi- 
nite words contain Sturmian words as a special subclass. Rampcrsad et al. point 
out a connection of abelian returns in Sturmian words to the three gap theorem. 
As shown in [10], the three gap theorem is closely connected to maps Tj induced 
by exchange of three intervals with permutation (321). It would be interesting 
to study abelian returns to factors of codings of three interval exchange maps. 
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Let us note that words arising as codings of such 3-interval exchanges satisfying 
Keane's condition i.d.o.c. have unbounded balances (see [1]), and thus also un- 
bounded abclian complexity. Rigo et al. [17] show the connection of finiteness 
of the set AV7Z U with boundedness of abelian complexity of the word u. We 
thus know directly that 3iet words with i.d.o.c. satisfy APlZu = +oo. 
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